A Measure of Text Formality as a Human Construct
نویسندگان
چکیده
Formality has long been of interest in the study of language and discourse, and different measures of formality have been developed to predict genre variation. However, it is unclear to what extent these formality metrics are similar to the formality construct perceived by humans. This study first investigated what linguistic features predicted the text formality as humans constructed, then developed a weighted formality model, and finally tested this measure in different approaches. The corpus of this study consisted of 390 excerpts in TASA corpus with three genres: language arts, social studies and science. The five Coh-Metrix dimensions were used to develop the weighted formality model. Results showed the weighted model perceived by humans was constructed by five dimensions as theories constructed, but each dimension contributed differently to formality construct. This formality model was evaluated through comparisons between human construct of formality, Flesch-Kincaid Grade Level, and genres. All results showed the weighted formality scores had much higher correlations with human judgments of formality than non-weighted formality scores.
منابع مشابه
Comparing Two Measures for Formality
Formality is an important dimension of language style. Texts of different genres tend to have different degrees of formality. F-score (formality-score) is a most popular measure for formality to differ genres. It uses a method of combining proportions of words of different types, with nouns, adjectives, articles and prepositions as positive elements, and adverbs, verbs and interjections as nega...
متن کاملAdjective Density as a Text Formality Characteristic for Automatic Text Classification: A Study Based on the British National Corpus
In this article, we report significant findings resulting from an investigation into the correlation between adjective density, calculated as the proportion of adjectives in word tokens, and degrees of text formality as part of an attempt to examine the potential application of adjectives in automatic text classification and identification. Correlations obtained from the training corpus will be...
متن کاملIranian Wedding Invitations in the Shifting Sands of Time
As a distinct socially constructed genre, wedding invitations (WIs) offer a fruitful site for investigating how two areas of genre knowledge (i.e., form and content) change over time under the influence of sociocultural forces. Through the examination of 100 Iranian WIs dating from 1970s to the present time, the study investigated the trajectories of change through time within the social semiot...
متن کاملFormality of Language: definition, measurement and behavioral determinants
A new concept of formality of linguistic expressions is introduced and argued to be the most important dimension of variation between styles or registers. Formality is subdivided into "deep" formality and "surface" formality. Deep formality is defined as avoidance of ambiguity by minimizing the context-dependence and fuzziness of expressions. This is achieved by explicit and precise description...
متن کاملEFL Writing Styles across Personality Traits and Gender: A Case for Iranian Academic Context
The ways individuals use words can reflect basic psychological processes, including clues to their thoughts, feelings, perceptions, and personality. This paper seeks to determine whether there is a relationship between Iranian EFL learners' writing styles and their personality and gender. It focuses on gender and two key dimensions of personality (Neuroticism and Extroversion), which were asse...
متن کامل